Low precision types support in Convert operation #5640

jdanieck · 2021-05-14T16:20:50Z

Details:

Add u1 type support + tests
Add u4 type support + tests
Add i4 type support + tests
Extended Serialization SLT with LP types
Added Convert to summarize.py report

Tickets:

46096

jdanieck · 2021-05-20T06:31:56Z

@ilyachur @jane-intel @pelszkow guys I think we should introduce general support of LP types to ngraph. We should consider introduction of ngraph::u1 type & vector<ngraph::u1> (an the same for u4 & i4) where we could encapsulate all this "binary magic". Also we will be able to dispatch templates against this type and avoid "if(LP_TYPE)" tricks like in this PR. Please notice that I needed to repeat LP types handling code from Constant here, and who knows how many times we'll need it in the future for other operations. What do you think ?

ilyachur · 2021-05-20T09:24:06Z

@ilyachur @jane-intel @pelszkow guys I think we should introduce general support of LP types to ngraph. We should consider introduction of ngraph::u1 type & vector<ngraph::u1> (an the same for u4 & i4) where we could encapsulate all this "binary magic". Also we will be able to dispatch templates against this type and avoid "if(LP_TYPE)" tricks like in this PR. Please notice that I needed to repeat LP types handling code from Constant here, and who knows how many times we'll need it in the future for other operations. What do you think ?

The LP types are introduced in order to minimize the size of weights. If we want to work with vector of u1 or other LP types it means what vector will contain holes (for u1 the size of each element in vector is 1 byte not 1 bit), it means that we will increase the memory consumption.

jdanieck · 2021-05-20T09:37:20Z

@ilyachur @jane-intel @pelszkow guys I think we should introduce general support of LP types to ngraph. We should consider introduction of ngraph::u1 type & vector<ngraph::u1> (an the same for u4 & i4) where we could encapsulate all this "binary magic". Also we will be able to dispatch templates against this type and avoid "if(LP_TYPE)" tricks like in this PR. Please notice that I needed to repeat LP types handling code from Constant here, and who knows how many times we'll need it in the future for other operations. What do you think ?

The LP types are introduced in order to minimize the size of weights. If we want to work with vector of u1 or other LP types it means what vector will contain holes (for u1 the size of each element in vector is 1 byte not 1 bit), it means that we will increase the memory consumption.

I was thinking about specialization like with vector of bool which would allow us to keep the same memory usage.

My main point is that we should have some common place for code handling LP types in ngraph, implementation details we can discuss when we agree that it's needed. If you want to think about it on concreate use case you can look at TestCase class and think how to properly introduce LP types handling there (at this point they are not handled).

ilyachur · 2021-05-20T12:03:40Z

I was thinking about specialization like with vector of bool which would allow us to keep the same memory usage.

My main point is that we should have some common place for code handling LP types in ngraph, implementation details we can discuss when we agree that it's needed. If you want to think about it on concreate use case you can look at TestCase class and think how to properly introduce LP types handling there (at this point they are not handled).

Yeah, I know about the vector of bool, but unfortunately we also have u4, i4 lp types. And for such types we don't have a good solution

jdanieck · 2021-05-20T12:13:07Z

I was thinking about specialization like with vector of bool which would allow us to keep the same memory usage.
My main point is that we should have some common place for code handling LP types in ngraph, implementation details we can discuss when we agree that it's needed. If you want to think about it on concreate use case you can look at TestCase class and think how to properly introduce LP types handling there (at this point they are not handled).

Yeah, I know about the vector of bool, but unfortunately we also have u4, i4 lp types. And for such types we don't have a good solution

I am really bad in communicating today :) I think it is technically possible to implement our own specialization of vector, e.g. vector<ngraph::u4> & vector<ngraph::i4> and follow the same implementation concept as in vector<bool> to decrease memory usage. @pelszkow could you confirm it is technically possible? I think we should at least consider it if is technically possible, and if we conclude it is too much work then we can reject.

pelszkow · 2021-05-20T12:20:00Z

It looks like we should provide some tool which will be some mix of std::vector<bool> and std::span. This class should allow to iterate over the buffer/memory and access to LP like std::vector<bool> does with bits/bools.
e.g.:

template<int BitsNo>
struct Span{
    class Iterator;
    BitWrapper <BitsNo> operator[](size_t i);
    Iterator begin();
    Iterator end();
    //....
};

template<int BitsNo>
struct SpanIterator{
    BitWrapper <BitsNo> operator*();
    operator++();
    //....
}

template<int BitsNo>
struct BitWrapper {
    BitWrapper (uint8_t* mem, int index);
    operator uint8_t () const {/*do bit magic */}
    BitWrapper operator=(uint8_t value) {/*do bit magic */}
    //....
}

ilyachur · 2021-05-20T12:21:17Z

I was thinking about specialization like with vector of bool which would allow us to keep the same memory usage.
My main point is that we should have some common place for code handling LP types in ngraph, implementation details we can discuss when we agree that it's needed. If you want to think about it on concreate use case you can look at TestCase class and think how to properly introduce LP types handling there (at this point they are not handled).

Yeah, I know about the vector of bool, but unfortunately we also have u4, i4 lp types. And for such types we don't have a good solution

I am really bad in communicating today :) I think it is technically possible to implement our own specialization of vector, e.g. vector<ngraph::u4> & vector<ngraph::i4> and follow the same implementation concept as in vector<bool> to decrease memory usage. @pelszkow could you confirm it is technically possible? I think we should at least consider it if is technically possible, and if we conclude it is too much work then we can reject.

Oh, ok, now I got your point. So in this case, I think we need to try to implement it.

jdanieck · 2021-05-20T14:17:01Z

I was thinking about specialization like with vector of bool which would allow us to keep the same memory usage.
My main point is that we should have some common place for code handling LP types in ngraph, implementation details we can discuss when we agree that it's needed. If you want to think about it on concreate use case you can look at TestCase class and think how to properly introduce LP types handling there (at this point they are not handled).

Yeah, I know about the vector of bool, but unfortunately we also have u4, i4 lp types. And for such types we don't have a good solution

I am really bad in communicating today :) I think it is technically possible to implement our own specialization of vector, e.g. vector<ngraph::u4> & vector<ngraph::i4> and follow the same implementation concept as in vector<bool> to decrease memory usage. @pelszkow could you confirm it is technically possible? I think we should at least consider it if is technically possible, and if we conclude it is too much work then we can reject.

Oh, ok, now I got your point. So in this case, I think we need to try to implement it.

Ok, so just to be on the same page, I'll create separate ticket for this. Let's merge this PR as is, including your comments ofz.

jdanieck · 2021-05-21T09:44:19Z

@ilyachur any chance you will review today? I'd love to merge it before FF :)

ngraph/test/backend/convert.in.cpp

ilyachur · 2021-05-21T10:12:36Z

@jdanieck I merged this PR, can we add more tests in the next PR?

* Add initial version of u1 type support. * Turn off u8_to_u1 test in IE.CPU. * Fix compilation issue. * Replace std::memset with std::fill. * Add u4 type support. * Add i4 support. * LP types support generalized. * Remove std::copy optimization. * Fix backend test for LP types. * Fixed arm plugin compilation. * Add LP types to Serialization SLT. * Add Convert to summarize.py report.

jdanieck added the category: Core OpenVINO Core (aka ngraph) label May 14, 2021

jdanieck self-assigned this May 14, 2021

Add initial version of u1 type support.

daebf97

jdanieck force-pushed the convert_lp_types branch from 9d08366 to daebf97 Compare May 14, 2021 16:26

jdanieck added 13 commits May 17, 2021 09:40

Turn off u8_to_u1 test in IE.CPU.

b85d80d

Merge remote-tracking branch 'upstream/master' into convert_lp_types

e1c4b9b

Fix compilation issue.

016d3c2

Replace std::memset with std::fill.

cfa513e

Add u4 type support.

8204c1f

Add i4 support.

548ca78

LP types support generalized.

d1f1d4a

Remove std::copy optimization.

bba7a69

Fix backend test for LP types.

760a0cd

Merge remote-tracking branch 'upstream/master' into convert_lp_types

7806b26

Merge remote-tracking branch 'upstream/master' into convert_lp_types

e57aea1

Fixed arm plugin compilation.

f58e41c

Add LP types to Serialization SLT.

19533e9

jdanieck marked this pull request as ready for review May 20, 2021 06:23

jdanieck requested review from a team, ilyachur, pelszkow and jane-intel and removed request for a team May 20, 2021 06:23

jdanieck added this to the 2021.4 milestone May 20, 2021

Add Convert to summarize.py report.

283eb19

jdanieck changed the title ~~Low precision types support in Convert reference implementation~~ Low precision types support in Convert operation May 20, 2021

ilyachur approved these changes May 21, 2021

View reviewed changes

ngraph/test/backend/convert.in.cpp Show resolved Hide resolved

ilyachur merged commit 9e46be7 into openvinotoolkit:master May 21, 2021

jdanieck deleted the convert_lp_types branch May 21, 2021 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Low precision types support in Convert operation #5640

Low precision types support in Convert operation #5640

jdanieck commented May 14, 2021 •

edited

Loading

jdanieck commented May 20, 2021 •

edited

Loading

ilyachur commented May 20, 2021

jdanieck commented May 20, 2021 •

edited

Loading

ilyachur commented May 20, 2021 •

edited

Loading

jdanieck commented May 20, 2021

pelszkow commented May 20, 2021

ilyachur commented May 20, 2021

jdanieck commented May 20, 2021

jdanieck commented May 21, 2021

ilyachur commented May 21, 2021

Low precision types support in Convert operation #5640

Low precision types support in Convert operation #5640

Conversation

jdanieck commented May 14, 2021 • edited Loading

Details:

Tickets:

jdanieck commented May 20, 2021 • edited Loading

ilyachur commented May 20, 2021

jdanieck commented May 20, 2021 • edited Loading

ilyachur commented May 20, 2021 • edited Loading

jdanieck commented May 20, 2021

pelszkow commented May 20, 2021

ilyachur commented May 20, 2021

jdanieck commented May 20, 2021

jdanieck commented May 21, 2021

ilyachur commented May 21, 2021

jdanieck commented May 14, 2021 •

edited

Loading

jdanieck commented May 20, 2021 •

edited

Loading

jdanieck commented May 20, 2021 •

edited

Loading

ilyachur commented May 20, 2021 •

edited

Loading